A Graph Skeleton Transformer Network for Action Recognition

نویسندگان

چکیده

Skeleton-based action recognition is a research hotspot in the field of computer vision. Currently, mainstream method based on Graph Convolutional Networks (GCNs). Although there are many advantages GCNs, GCNs mainly rely graph topologies to draw dependencies between joints, which limited capturing long-distance dependencies. Meanwhile, Transformer-based methods have been applied skeleton-based because they effectively capture However, existing lose inherent connection information human skeleton joints do not yet focus initial structure information. This paper aims improve accuracy recognition. Therefore, Skeleton Transformer network (GSTN) for proposed, architecture extract global features, while using undirected represented by symmetric matrix local features. Two encodings utilized feature processing joints’ semantic and centrality In process multi-stream fusion strategies, grid-search-based used assign weights each input stream optimize results. We tested our three datasets: NTU RGB+D 60, 120, NW-UCLA. The experimental results show that model’s comparable state-of-the-art approaches.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spatio-Temporal Graph Convolution for Skeleton Based Action Recognition

Variations of human body skeletons may be considered as dynamic graphs, which are generic data representation for numerous real-world applications. In this paper, we propose a spatio-temporal graph convolution (STGC) approach for assembling the successes of local convolutional filtering and sequence learning ability of autoregressive moving average. To encode dynamic graphs, the constructed mul...

متن کامل

Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition

Dynamics of human body skeletons convey significant information for human action recognition. Conventional approaches for modeling skeletons usually rely on hand-crafted parts or traversal rules, thus resulting in limited expressive power and difficulties of generalization. In this work, we propose a novel model of dynamic skeletons called SpatialTemporal Graph Convolutional Networks (ST-GCN), ...

متن کامل

Extended Investigations on Skeleton Graph Matching for Object Recognition

Shape similarity estimation of objects is a key component in many computer vision systems. In order to compare two shapes, salient features of a query and target shape are selected and compared with each other, based on a predefined similarity measure. The challenge is to find a meaningful similarity measure that captures most of the original shape properties. One well performing approach calle...

متن کامل

Two-Stream 3D Convolutional Neural Network for Skeleton-Based Action Recognition

It remains a challenge to efficiently extract spatialtemporal information from skeleton sequences for 3D human action recognition. Although most recent action recognition methods are based on Recurrent Neural Networks which present outstanding performance, one of the shortcomings of these methods is the tendency to overemphasize the temporal information. Since 3D convolutional neural network(3D...

متن کامل

Leveraging the Path Signature for Skeleton-based Human Action Recognition

Human action recognition in videos is one of the most challenging tasks in computer vision. One important issue is how to design discriminative features for representing spatial context and temporal dynamics. Here, we introduce a path signature feature to encode information from intra-frame and inter-frame contexts. A key step towards leveraging this feature is to construct the proper trajector...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Symmetry

سال: 2022

ISSN: ['0865-4824', '2226-1877']

DOI: https://doi.org/10.3390/sym14081547